SITE LINK

KMID : 0917520000070010017

Journal of Speech Sciences
2000 Volume.7 No. 1 p.17 ~ p.29

Speech Quality of a Sinusoidal Model Depending on the Number of Sinusoids

Seo, Jeong Wook
Kim, Ki Hong/Seok, Jong Won/Bae, Keun Sung

Abstract

The STC(Sinusoidal Transfrom coding) is a vocoding technique that uses a sinusoidal speech model to obtain high quality speech at low data rate. It models and synthesizes the speech signal with fundamental frequency and its harmonic elements in frequency domain. To reduce the date rate, it is necessary to represent the sinusoidal amplitudes and phases with as small number of peaks as possible while maintaining the speech quality. As a basic research to develop a low-rate speech coding algorithm using the sinusoidal model, in this paper, we investigate the speech quality depending on the number of sinusoids. By varying the number of spectral peaks from 5 to 40 speech signals are reconstructed, and then their qualities are evaluated using spectral envelope distortion measure and MOS(Mean Opinion Score). Two approaches are used to obtain the spectral peaks: one is a conventional STFT (Short-Time Fourier Transform), and the other is a multiresolutional analysis method.
Keywords : speech coding, STC, speech quality, sinusoidal model

KEYWORD

FullTexts / Linksout information

Listed journal information

site infomation

Prohibition of Unauthorized Collection of E-mail Addresses, medric.kyung@gmail.com
N4 301, Chungbuk National University, Chungdae-ro 1, Seowon-Gu, Cheongju, Chungbuk 28644, Korea